PATTERN AVOIDANCE IN BINARY TREES 



ERIC S. ROWLAND 

Abstract. This paper considers the enumeration of trees avoiding a contigu- 
ous pattern. We provide an algorithm for computing the generating function 
that counts n-leaf binary trees avoiding a given binary tree pattern t. Equipped 
with this counting mechanism, we study the analogue of Wilf equivalence in 
which two tree patterns are equivalent if the respective n-leaf trees that avoid 
them are equinumerous. We investigate the equivalence classes combinato- 
rially. Toward establishing bijective proofs of tree pattern equivalence, we 
develop a general method of restructuring trees that conjecturally succeeds to 
produce an explicit bijection for each pair of equivalent tree patterns. 



1. Introduction 

Determining the number of words of length n on a given alphabet that avoid 
a certain (contiguous) subword is a classical combinatorial problem that can be 
solved, for example, by the principle of inclusion-exclusion. An approach to this 
question using generating functions is provided by the Goulden-Jackson cluster 
method [SJ [7] , which utilizes only the self-overlaps (or "autocorrelations" ) of the 
word being considered. A natural question is "When do two words have the same 
avoiding generating function?" That is, when are the n-letter words avoiding (re- 
spectively) Wi and W2 equinumerous for all n? The answer is simple: precisely 
when their self-overlaps coincide. For example, the equivalence classes of length-4 
words on the alphabet {0, 1} are as follows. 

equivalence class self-overlap lengths 
{0001, 0011, 0111, 1000, 1100, 1110} {4} 
{0010, 0100, 0110, 1001, 1011, 1101} {1, 4} 

{0101,1010} {2,4} 
{0000,1111} {1,2,3,4} 

In this paper we consider the analogous questions for plane trees. All trees in 
the paper are rooted and ordered. 

Our focus will be on binary trees — trees in which each vertex has or 2 (ordered) 
children. A vertex with children is a leaf, and a vertex with 2 children is an internal 
vertex. A binary tree with n leaves has n — 1 internal vertices, and the number of 
such trees is the Catalan number C„_i. The first few binary trees are depicted in 
Figure [T] We use an indexing for n-leaf binary trees that arises from the natural 
recursive construction of all n-leaf binary trees by pairing each /c-leaf binary tree 
with each (n — fc)-leaf binary tree, for all 1 < fc < n — 1. In practice it will be clear 
from context which tree we mean by, for example, ''ti\ 

Conceptually, a binary tree T avoids a tree pattern t if there is no instance of t 
anywhere inside T. Steyaert and Flajolet |12j were interested in such patterns in 
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Figure 1. The binary trees with at most 5 leaves. 



vertex-labeled trees. They were mainly concerned with the asymptotic probability 
of avoiding a pattern, whereas our focus is on enumeration. However, they establish 
in Section 2.2 that the total number of occurrences of an m-leaf binary tree pattern 
t in all n-leaf binary trees is 

/2n — m\ 
\n — m J 

In this sense, all m-leaf binary trees are indistinguishable; the results of this paper 
refine this statement by further distinguishing m-leaf tree patterns by the number 
of n-leaf trees containing precisely k copies of each. 

We remark that a different notion of tree pattern was later considered by Flajo- 
let, Sipala, and Steyaert [1], in which every leaf of the pattern must be matched by 
a leaf of the tree. Such patterns are only matched at the bottom of a tree, so they 
arise naturally in the problem of compactly representing in memory an expression 
containing repeated subexpressions. The enumeration of trees avoiding such a pat- 
tern is simple, since no two instances of the pattern can overlap: The number of 
n-leaf binary trees avoiding t depends only on the number of leaves in t. See also 
Flajolet and Sedgewick [31 Note III.40]. 

The reason for studying patterns in binary trees as opposed to rooted, ordered 
trees in general is that it is straightforward to determine what it should mean for 
a binary tree to avoid, for example, 

whereas a priori it is ambiguous to say that a general tree avoids 

A- 

Namely, for general trees, 'matches a vertex with i children' for i > 1 could mean 
either 'has exactly i children' or 'has at least i children'. For binary trees, these are 
the same for i = 2, so there is no choice to be made. 

However, it turns out that the notion of pattern avoidance for binary trees in- 
duces a well-defined notion of pattern avoidance for general trees. This arises via 
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the natural bijection /3 between the set of n-leaf binary trees and the set of n- 
vertex trees; using this bijection, one simply translates the problem into the setting 
of binary trees. 

One main theoretical purpose of this paper is to provide an algorithm for com- 
puting the generating function that counts binary trees avoiding a certain tree 
pattern. This algorithm easily generalizes to count trees containing a prescribed 
number of occurrences of a certain pattern, and additionally we consider the num- 
ber of trees containing several patterns each a prescribed number of times. All of 
these generating functions are algebraic. Section |4] is devoted to these algorithms, 
which are implemented in TreePatterns [5], a Mathematica package available 
from the author's website. 

By contrast, another main purpose of this paper is quite concrete, and that is to 
determine equivalence classes of binary trees. We say that two tree patterns s and t 
are equivalent if for all n > 1 the number of n-leaf binary trees avoiding s is equal to 
the number of n-leaf binary trees avoiding t. In other words, equivalent trees have 
the same generating function with respect to avoidance. This is the analogue of Wilf 
equivalence in permutation patterns. Each tree is trivially equivalent to its left- 
right reflection, but there are other equivalences as well. The first few classes are 
presented in Section [3] The appendix contains a complete list of equivalence classes 
of binary trees with at most 6 leaves, from which we draw examples throughout 
the paper. Classes are named with the convention that class m.i is the ith class of 
m-leaf binary trees. 

We seek to understand equivalence classes of binary trees combinatorially, and 
this is the third purpose of the paper. By analogy with words, one might hope 
for a simple criterion such as "s and t are equivalent precisely when the lengths 
of their self-overlaps coincide"; however, although the set of self-overlap lengths 
seems to be preserved under equivalence, this statement is not true, for {1, 1, 2, 2, 5} 
corresponds to both classes 6.3 and 6.7. In lieu of a simple criterion, we look for 
bijections. As discussed in Section |3.5[ in a few cases there is a bijection between 
n-leaf binary trees avoiding a certain pattern and Dyck (n — l)-words avoiding a 
certain (contiguous) subword. In general, when s and t are equivalent tree patterns, 
we would like to provide a bijection between trees avoiding s and trees avoiding t. 
Conjecturally, all classes of binary trees can be established bijectively by top-down 
and bottom-up replacements; this is the topic of Section [5] Nearly all bijections in 
the paper are implemented in the package TreePatterns. 

Aside from mathematical interest, a general study of pattern avoidance in trees 
has applications to any collection of objects related by a tree structure, such as 
people in a family tree or species in a phylogenetic tree. In particular, this paper 
answers the following question. Given n related objects (e.g., species) for which 
the exact relationships aren't known, how likely is it that some prescribed (e.g., 
evolutionary) relationship exists between some subset of them? (Unfortunately, 
it probably will not lead to insight regarding the practical question "What is the 
probability of avoiding a mother-in-law?") Alternatively, we can think of trees 
as describing the syntax of sentences in natural language or of fragments of com- 
puter code; in this context the paper answers questions about the occurrence and 
frequency of given phrase substructures. 
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2. Definitions 

2.1. Avoidance. The more formal way to think of an n- vertex tree is as a partic- 
ular arrangement of n pairs of parentheses, where each vertex is represented by the 
pair of parentheses containing its children. For example, the tree 

is represented by (()(()())). This is the word representation of this tree in the 
alphabet {(,)}• We do not formally distinguish between the graphical represen- 
tation of a tree and the word representation, and it is the latter that is useful in 
manipulating trees algorithmically. {Mathematical s pattern matching capabilities 
provide a convenient tool for working with trees represented as nested lists, so this 
is the convention used by TreePatterns.) 

Informally, our concept of containment is as follows. A binary tree T contains 
t if there is a (contiguous, rooted, ordered) subtree of T that is a copy of t. For 
example, consider 

None of the trees 

contains a copy of t, while each of the trees 
contains precisely one copy of i, each of the trees 

\ \ h/>^ A ^A^^ 4:^ 

contains precisely two (possibly overlapping) copies of t, and the tree 




contains precisely three copies of t. This is a classification of binary trees with at 
most 5 leaves according to the number of copies of t. 

We might formalize this concept with a graph theoretic definition as follows. Let 
i be a binary tree. A copy of t in T is a subgraph of T (obtained by removing ver- 
tices) that is isomorphic to t (preserving edge directions and the order of children). 
Naturally, T avoids t if the number of copies of t in T is 0. 

An equivalent but much more useful definition is a language theoretic one, and 
to provide this we first distinguish a tree pattern from a tree. 

By 'tree pattern', informally we mean a tree whose leaves are "blanks" that 
can be filled (matched) by any tree, not just a single vertex. More precisely, let 
S = {(,)}, and let L be the language on E containing (the word representation 
of) every binary tree. Consider a binary tree r, and let t be the word on the 
three symbols (,),L obtained by replacing each leaf in r by L. We call t 
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the tree pattern of t. This tree pattern naturally generates a language Lt on E, 
which we obtain by interpreting the word t as a product of the three languages 
( = {(}, )={)}, L. Informally, Lt is the set of words that match t. We think of t 
and Lt interchangeably. (Note that a tree is a tree pattern matched only by itself.) 
For example, let 

(()(()())); 



then the corresponding tree pattern is t = (L(LL)), and the language Lt consists 
of all trees of the form (TCUV) ) , where T, U, V are binary trees. 

Let E* denote the set of all finite words on E. The language E*LtE* n L is the 
set of all binary trees whose word has a subword in Lt- Therefore we say that a 
binary tree T contains the tree pattern < if T is in the language E*LtE* HL. We can 
think of this language as a multiset, where a given tree T occurs with multiplicity 
equal to the number of ways that it matches E*LfE*. Then the number of copies 
of t in T is the multipUcity of T in E*LfE* n L. 

Continuing the example from above, the tree 



(()((()())(()()))) 



contains 2 copies of t since it matches E*I/tE* in 2 ways: CTCUV)) with T = 
andU = V= (0 0), and iO(.T(UV))) with T = (0 0) and [/ = F = (). 

Our notation distinguishes tree patterns from trees: Tree patterns are repre- 
sented by lowercase variables, and trees are represented by uppercase variables. To 
be absolutely precise, we would graphically distinguish between terminal leaves O 
of a tree and blank leaves L of a tree pattern, but this gets in the way of speaking 
about them as the same objects, which is rather convenient. 

In Sections |4] and [5] we will be interested in taking the intersection pD q oi tree 
patterns p and q (by which we mean the intersection of the corresponding languages 
Lp and Lg). The intersection of two or more explicit tree patterns can be computed 
recursively: pD L = p, and (piPr) H (qiqr) = ({pi H qi){pr H qr)) ■ 

2.2. Generating functions. Our primary goal is to determine the number a„ 
of binary trees with n vertices that avoid a given binary tree pattern and more 
generally to determine the number an,k of binary trees with n vertices and precisely 
k copies of t. Thus we consider two generating functions associated with t: the 
avoiding generating function 

oo 

Avt (x)^ ^number of vertices in T ^ ^ 

T avoids t n—0 

and the enumerating generating function 

Ell/^ (x y) ^ ^ ^number of vertices in T^number of copies of i in T 

oo oo 
11=0 k=0 

The avoiding generating function is the special case Avf(a;) = En^ f(a;,0). 
Theorem 1. Eni j(a;, j/) is algebraic. 
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The proof is constructive, so it enables us to compute Eni_t(x), and in particular 
Avt{x), for explicit tree patterns. We postpone the proof until Section 4.2 to address 
a natural question that arises: Which trees have the same generating function? 
That is, for which pairs of binary tree patterns s and t are the n-leaf trees avoiding 
(or containing k copies of) these patterns equinumerous? 

We say that s and t are avoiding- equivalent if Avs(a;) = Avtix). We say 
they are enumerating- equivalent if the seemingly stronger condition En^ ,j(a;, y) — 
EiiL,tix,y) holds. We can compute these equivalence classes explicitly by comput- 
ing Avt(x) and EiiiL_tix,y) for, say, all m-leaf binary tree patterns t. In doing this 
for binary trees with up to 7 leaves, one comes to suspect that these conditions are 
in fact the same. 

Conjecture 2. If s andt are avoiding- equivalent, then they are also enumerating- 
equivalent. 

In light of this experimental result, we focus attention in the remainder of the 
paper on classes of avoiding-equivalence, since conjecturally they are the same as 
classes of enumerating-equivalence. 

3. Initial inventory and some special bijections 

In this section we undertake an analysis of small patterns. We determine Avt(a;) 
for binary tree patterns with at most 4 leaves using methods specific to each. This 
allows us to establish the equivalence classes in this range. 

3.1. 1-leaf trees. There is only one binary tree pattern with a single leaf, namely 

t = ' =L. 

Every binary tree contains at least one vertex, so Avt(a;) = 0. The number of 
binary trees with 2n — 1 vertices is C„_i, so 

oo 

EnL{x) =x + x^ + 2a;'^ + 5x^ + 14a;^ + 42a;" + • • • = ^ C„_ia;2"^^ 

n=l 

3.2. 2-leaf trees. There is also only one binary tree pattern with precisely 2 leaves: 

t = A= CLL). 

However, t is a fairly fundamental structure in binary trees; the only tree avoiding 
it is the 1-vertex tree (). Thus Avt(a::) = x, and 

Eni,,t(x, yj — G„_ix y — . 

n=i ■^^y 

3.3. 3-leaf trees. There are C2 = 2 binary trees with 3 leaves, and they are 
equivalent by left-right reflection: 

\ and 

There is only one binary tree with n leaves avoiding 

,A= ((LDL), 

namely the "right comb" (()(()(()(() •••)))). Therefore for these trees 

A-yAx) = X + x^ + x^ + x'^ + x"^ + x^^ ^ = — 

1 — x^ 
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3.4. 4-leaf trees. Among 4-leaf binary trees we find more interesting behavior. 
There are C3 = 5 such trees, pictured as follows. 

They comprise 2 equivalence classes. 

Class 4-1- The first equivalence class consists of the trees 

h = 'O^ and = . 
The avoiding generating function Avt(a;) for each of these trees satisfies 

because the number of n-leaf binary trees avoiding ti is the Motzkin number Af„_i: 

00 

Avt(x) ^x + x^ + 2x^ + 4x^ + 9a;^ + 21a;" H = V M„ 



n=l 

This fact is presented by Donaghey and Shapiro [2, as their final example of 
objects counted by the Motzkin numbers. They provide a bijective proof which we 
reformulate here. Specifically, there is a natural bijection between the set of n-leaf 
binary trees avoiding ti and the set of Motzkin paths of length n — 1 — paths from 
(0, 0) to [n — 1, 0) composed of steps (1, —1), (1, 0), (1, 1) that do not go below the 
X-axis. We represent a Motzkin path as a word on { — 1, 0, 1} encoding the sequence 
of steps under (1, Ay) ^ Ay. 

Let /3 be the usual bijection from n-leaf binary trees to n-vertex general trees 
that operates by contracting every rightward edge. To obtain the Motzkin path 
associated with a binary tree T avoiding ti: 

(1) Let T' — /3{T). No vertex in T' has more than 2 children, since 
and T avoids ti. 

(2) Create a word w on {—1, 0, 1} by traversing T' in depth-first order (i.e., for 
each subtree visit first the root vertex and then its children trees in order); 
for each vertex, record 1 less than the number of children of that vertex. 

(3) Delete the last letter of w (which is —1). 

The resulting word contains the same number of —Is and Is, and every prefix 
contains at least as many Is as —Is, so it is a Motzkin path. The steps are easily 
reversed to provide the inverse map from Motzkin paths to binary trees avoiding 
ti. (For the larger context of this bijection, see Stanley's presentation leading up 
to Theorem 5.3.10 [H]-) 

Class 4-2- The second equivalence class consists of the three trees 
h = h = and t4 = 

and provides the smallest example of nontrivial equivalence. Symmetry gives 
Avt^ix) = Avt^{x). To establish Avt2(a;) = Avt^{x), for each of these trees t we 
give a bijection between n-leaf binary trees avoiding t and binary words of length 
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n — 2. By composing these two maps we obtain a bijection between trees avoiding 
^2 and trees avoiding ^3. 
First consider 

*3 = A- 

If T avoids is, then no vertex of T has four grandchildren; that is, at most one of 
a vertex's children has children of its own. This implies that at each generation at 
most one vertex has children. Since there are two vertices at each generation after 
the first, the nmnber of such 71-leaf trees is 2""^ for n > 2: 

Avt3 (x) = X + x3 + 2x5 + + + 16x" + . . . ^ x + V 2"-2^2„-i ^ . 

11=2 

Form a word w G {0, 1}"^^ corresponding to T by letting the ilh letter be or 1 
depending on which vertex (left or right) on level i + 1 has children. 
Now consider 

A "typical" binary tree avoiding ^2 looks like 




and is determined by the length of its spine and the length of each arm. Starting 
from the root, traverse the internal vertices of a tree T avoiding t2 according to 
the following rule. Always move to the right child of a vertex when the right 
child is an internal vertex, and if the right child is a leaf then move to the highest 
unvisited internal spine vertex. By recording and 1 for left and right movements 
in this traversal, a word w on {0, 1} is produced that encodes T uniquely. We have 
\w\ = n — 2 since we obtain one symbol from each internal vertex except the root. 
Since every word w corresponds to an n-leaf binary tree avoiding t2, there are 2"~^ 
such trees. 

More formally, let a; be a map from binary trees to binary words defined by 
Lo{{TiTr)) = Ki{Tr)K.o{Ti), whcrc 




if r= 

(T) otherwise. 



Then the word corresponding to T is w = lo{T). 

For the inverse map begin with the word ilr) . Then read w left to right. 
When the symbol 1 is read, replace the existing r by ( Or) ; when is read, replace 
the existing r by () and the existing / by {Ir) . After the entire word is read, replace 
the remaining / and r with () . One verifies that T has n leaves. The tree T avoids 
^2 because the left child of an r vertex never has children of its own. 
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3.5. Bijections to Dyck words. Wc mention that in some cases the set of trees 
avoiding a pattern is in bijection to the set of Dyck words avoiding a certain sub- 
word. For example, trees avoiding a pattern in class 4.1 are in bijection to Dyck 



words avoiding the subword 000. Recall P from Section [3^ For we have 

whose corresponding Dyck word is 000111, and /3 has the feature that T contains 
tr, if and only if the Dyck word corresponding to /3{T) contains 000. 

In general there is a bijection between n-leaf binary trees avoiding t and {n — 1)- 
Dyck words avoiding w whenever w is a characteristic feature of I3{t), that is, some 
feature of the tree that is preserved locally by /3. 

For example, for t2 in class 4.2 we observe that 



/3 A = A = 010011, 



and containing 100 as a subword is necessary and sufficient for the corresponding 
tree to contain ^2- Thus binary trees avoiding a pattern in class 4.2 are in bijection 
to Dyck words avoiding 100. Other bijections can be found similarly: Classes 5.2, 
5.3, 6.3, and 6.6 correspond to the words 1100, 1000, 11000, and 10000. 

Notes on sequences counting Dyck words avoiding a subword have been con- 
tributed by David Callan and Emeric Deutsch to the Encyclopedia of Integer Se- 
quences [in]. The subject appears to have begun with Deutsch [U Section 6.17], 
who enumerated Dyck words according to the number of occurrences of 100. Sa- 
pounakis, Tasoulas, and Tsikouras [S] have considered additional subwords. Via 
the bijections just described, their results provide additional derivations of the gen- 
erating functions Avt(a;). 



4. Algorithms 

In this section we provide algorithms for computing algebraic equations satisfied 
by Avtix), En^ f (a;, y), and the more general Eni_j,j^...^pj. (xl, a;pi , . . . ,a;pj.) defined 



in Section 4.3 Computing Avt{x) or EnL_t(x, y) for all m-leaf binary tree patters t 
allows one to automatically determine the equivalence classes given in the appendix. 

We draw upon the notation introduced in Section |2.1| In particular, the inter- 
section pOp' of two tree patterns plays a central role. Recall that Lp is the set of 
trees matching p at the top level. 

The depth of a vertex in a tree is the length of the minimal path to that vertex 
from the root, and depth(T) is the maximum vertex depth in the tree T. 

4.1. Avoiding a single tree. Fix a binary tree pattern t we wish to avoid. For a 
given tree pattern p, we will make use of the generating function 

weight (p) = weight (Lp) := ^ weight (T), 

TeLp 

where 

if T avoids t 
if T contains t. 



weight (T) 
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The case t — L was covered in Section [XTj so we assume t ^ L. Then t = ititr) 
for some tree patterns ti and tj.. Since (TiTj.) matches t precisely when T; matches 
ti and Tr matches tr, we have 

(1) weight{(pipr)) = 

X ■ (weight (pi) • weight (pr) — weight (pj n ij) • weight (p^ Hi,-)). 

The coefficient x is the weight of the root vertex of (piPr) that we destroy in 
separating this pattern into its two subpatterns. 

We now construct a polynomial (with coefficients that are polynomials in x) that 
is satisfied by Av((x) = weight (i), the weight of the language of binary trees. The 
algorithm is as follows. 

Begin with the equation 

weight(L) = weight(()) + weight((LL)). 

The variable weight((LD) is "new"; we haven't yet written it in terms of other 
variables. So use Equation ([T]) to rewrite weight((LD). For each expression 
weight(p n p') that is introduced, we compute the intersection p D p' . This al- 
lows us to write weight(pnp') as weight(g) for some pattern q that is simply a word 
on {(, ),L} (i.e., does not contain the n operator). 

For each new variable weight (g), we obtain a new equation by making it the left 
side of Equation (jlj, and then as before we eliminate H by explicitly computing 
intersections. 

We continue in this manner until there are no new variables produced. This must 
happen because depth(pnp') = max(depth(p), depth(p')), so since there are only 
finitely many trees that are shallower than i, there are only finitely many variables 
in this system of polynomial equations. 

Finally, we compute a Grobner basis for the system in which all variables ex- 
cept weight(()) = x and weight(L) = Avt(a;) are eliminated. This gives a single 
polynomial equation in these variables, establishing that Avt(a;) is algebraic. 

Let us work out an example. We use the graphical representation of tree patterns 
with the understanding that the leaves are blanks. Consider the tree pattern 




= (LiLdLDL))) 



from class 5.2. The first equation is 

weight ( • ) = X + weight {.'.) . 

We have ti — • and tr = ' /\, so Equation 

weight (.".) — X ■ (weight ( • ) • weight ( • ) — weight ( ■ n • ) • weight ( • n ^<N)) 

= X ■ (weight ( • )^ — weight ( • ) • weight (■^\)) 

since L (1 p = p for any tree pattern p. The variable weight('^.) = weight(tr) is 
new, so we put it into Equation ([T]): 

weight(' .;■.) = X ■ (weight( • ) • weight(/"-) — weight( • n • ) • weight(/^ n ^^)) 
= X ■ (weight( • ) • weight(_«]-) — weight( • ) • weight( aX)) . 



1 ) gives 
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There are two new variables: 

weight(/''') = X ■ (weight(/.) • weight( • ) — weight(.". n • ) • weight( • n '/'-.)) 
= X ■ (weight (.'.) • weight ( ■ ) — weight (.'.) • weight ('C^)) ; 
weight( a'^.) = X ■ (weight(.\) • weight(_']') — weight(A n • ) • weight(/^' fl ys)) 
= X ■ (weight(.'.) • weight(//) — weight(.'.) • weight (. K^)) . 
We have no new variables, so we eliminate the four auxiliary variables 

weight(A), weight(_.|-), weight(|^^\), weight(/-r<".) 
from this system of five equations to obtain 

weight( • f - (x^ - if weight( •) - x {x^ - 1) = 0. 

4.2. Enumerating with respect to a single tree. To prove Theorem [T] we 
make a few modifications in order to compute EnL_t(x, y) instead of Avt{x). Again 

weight (p) := ^ weight (T), 

but now weight(r) rcnumher of vertices in T ynumhcr o{ copies of t in T f^j. g^jj -^g 

modify Equation ([ij to become 

(2) weight((pipr)) = 

X ■ (weight(p;) • weight(pr) + {y ^ ^) ■ weight(p; n t;) • weight(pr H t^)) 

since in addition to accounting for the trees that avoid t we also account for those 
that match t, in which case y is contributed. 

The rest of the algorithm carries over unchanged, and we obtain a polynomial 
equation in x, y, and EnL.t(a;, y) = weight(L). 

4.3. Enumerating with respect to multiple trees. A more general question 
is the following. Given several binary tree patterns pi, . . . ,pk, what is the number 

of binary trees containing precisely hq vertices, ni copies of pi, . . . , Uk 
copies of pfc? We consider the enumerating generating function 



L ^pi -^Pk 

Teh 

oo oo oo 

.,nk-^L -^pi -^Pk ' 
no— Oni— nk—0 



where po ^ L and a; is the number of copies of pi in T. (We need not assume 
that the pi are distinct.) This generating function can be used to obtain infor- 
mation about how correlated a family of tree patterns is. We have the following 
generalization of Theorem [T] 

Theorem 3. EnL^p^^,„^p^{xL, Xp^ , . . . , Xpi^ ) is algebraic. 
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Keeping track of multiple tree patterns pi, . . . ,pk is not much more complicated 
than handling a single pattern, and the algorithm for doing so has the same outline. 
Let 

weight (p) :— weight (T) 

TeLp 

with 

where again is the number of copies of pi in T. Let d = maxi<i<fc depth(pi). 
First we describe what to do with each new variable weight(5) that arises. The 
approach used is different than that for one tree pattern; in particular, we do not 
make use of intersections. Consequently, it is less efficient. 

Let I be the number of leaves in q. If T is a tree matching q, then for each leaf 
L of q there are two possibilities: Either L is matched by a terminal vertex () in 
T, or L is matched by a tree matching (LL) . For each leaf we make this choice 
independently, thus partitioning the language Lq into 2' disjoint sets represented 
by 2' tree patterns that are disjoint in the sense that each tree matching q matches 
precisely one of these patterns. For example, partitioning the pattern (LL) into 2^ 
patterns gives 

weight((iL)) =weight((()())) + 

weight( ( (LL) ) ) + weight( ( (LL) () ) ) + weight( ( (LL) (LL) ) ). 

We need an analogue of Equation ^ for splitting a pattern (piPr ) into the two 
subpatterns pi and Pr- For this, examine each of the 2' patterns that arose in 
partitioning q. For each pattern p — (piPr) whose language is infinite (that is, the 
word p contains the symbol L) and has depth(p) > d, rewrite 

weight (p) = weight (p;) • weight (p^) • J]^ Xp,, 

0<i<fc 
p matches pi 

where 'p matches pi ' means that every tree in Lp matches pi (so Lp C Lp. ) . If Lp 
is infinite but depth(p) < d, keep weight (p) intact as a variable. 

Finally, for all tree patterns p whose language is finite (i.e., p is a tree), rewrite 

weight (p) = Yl 2;™'"^"' °^ °^ P. 

0<i<fc 

The algorithm is as follows. As before, begin with the equation 

weight(L) — weight(()) + wcight((LL)). 

At each step, take each new variable weight(g) and obtain another equation by 
performing the procedure described: Write it as the sum of 2' other variables, 
split the designated patterns into subpatterns, and explicitly compute the weights 
of any trees appearing. Continue in this manner until there are no new variables 
produced; this must happen because we break up weight (p) whenever depth(p) > d, 
so there are only finitely many possible variables. Eliminate from this system of 
polynomial equations all but the fc + 2 variables weight(()) = Xl-, Xp-^, . . . , Xp^, and 
weight (L) = Eni^pj^..._pj^(a;L, Xpi , • . • , Xpi^ ) to obtain a polynomial equation satisfied 
by En^ (x/, , Xp^ j ■ ■ • ; Xp^, ) . 
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5. Replacement bijections 

In this section we address the question of providing systematic bijective proofs of 
avoiding-equivalence. Given two equivalent binary tree patterns s and t, we would 
like to produce an explicit bijection between binary trees avoiding s and binary trees 
avoiding t. It turns out that this can often be achieved by structural replacements 
on trees. We start by describing an example in full, and later generalize. 

5.1. An example replacement bijection. Consider the trees 

h = ^^^\ and = ^ 

in class 4.2. The idea is that since n-leaf trees avoiding t2 are in bijection to rt-leaf 
trees avoiding ts , then somehow swapping all occurrences of these two tree patterns 
should produce a bijection. However, since the patterns may overlap, it is necessary 
to specify an order in which to perform the replacements. A natural order is to 
start with the root and work down the tree. More precisely, a top-down replacement 
is a restructuring of a tree T in which we iteratively apply a set of transformation 
rules to subtrees of T, working downward from the root. 
Take the replacement rule to be 



abed ' ) 
a b 

where the variables represent trees attached at the leaves, rearranged according to 
the permutation 3124. Begin at the root: If T itself matches the left side of the 
rule, then we restructure T according to the rule; if not, we leave T unchanged. 
Then we repeat the procedure on the root's (new) children, then on their children, 
and so on, so that each vertex in the tree is taken to be the root of a subtree which 
is possibly transformed by the rule. For example, 




shows the three replacements required to compute the image (on the right) of a 
tree avoiding t2- The resulting tree avoids ^3. 

This top-down replacement is invertible. The inverse map is a bottom-up replace- 
ment with the inverse replacement rule. 



,• bead 
b c 

Rather than starting at the root and working down the tree, we apply this map by 
starting at the leaves and working up the tree. 

We now show that the top-down replacement is in fact a bijection from trees 
avoiding t2 to trees avoiding ^3. It turns out to be the same bijection given in 
Section 3.4 via words in {0, 

Assume T avoids t2 ; we show that the image of T under the top-down replace- 
ment avoids t-i . It is helpful to think of T as broken up into (possibly overlapping) 
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"spheres of influence" — subtrees which are maximal with respect to the replace- 
ment rule in the sense that performing the top-down replacement on the subtree 
does not introduce instances of the relevant tree patterns containing vertices out- 
side of the subtree. It suffices to consider each sphere of influence separately. A 
natural focal point for each sphere of influence is the highest occurrence of ^3. We 
verify that restructuring this ^3 to ^2 under the top-down replacement produces no 
^3 above, at, or below the root of the new t2 in the image of T. 

above: Since ^3 has depth 2, can occur at most one level above the root of the 
new ^2 while overlapping it. Thus it suffices to consider all subtrees with 
^3 occurring at level 1. There are two cases. 




abed abed 



The first case does not avoid t2 , so it does not occur in T. The second case 
may occur in T. However, we do not want the subtree itself to match 
(because we assume that the at level 1 is the highest ^3 in this sphere of 
influence) , so we must have e = ( ) . Thus this subtree is transformed by 
the top-down replacement as 




The image does not match at the root, so does not appear above the 
root of the new t2. 

at: Since T avoids ^2 , every subtree in T matching ^3 in fact matches the pattern 
( (LL) ( W ) . Such a subtree is restructured as 

ab d ^ ^ 

under the replacement rule, and the image does not match (because 
c = () is terminal). Therefore the new t2 cannot itself match ^3. 
below: A general subtree matching ^3 and avoiding t2 is transformed as 




-)■•••. 



a b 



Clearly t'i can only occur in the image at or below the subtree iab) . How- 
ever, since iah) is preserved by the replacement rule, any transformations 
on iah) can be considered independently. That is, iah) is the top of a 
different sphere of influence, so we need not consider it here. We conclude 
that <3 does not occur below the root of the new t2- 
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If we already knew that t2 and are equivalent (for example, by having com- 



puted Avt{x) as in Section 4.1 ). then we have now obtained a bijective proof of their 
equivalence. Otherwise, it remains to show that if T avoids ^3, then performing the 
bottom- up replacement produces a tree that avoids this can be accomplished 
similarly. 

5.2. General replacement bijections. A natural question is whether for any 
two equivalent binary tree patterns s and t there exists a sequence of replacement 
bijections and left-right reflections that establishes their equivalence. For tree pat- 
terns of at most 7 leaves the answer is "Yes", which perhaps suggests that these 
maps suffice in general. 

Conjecture 4. // s and t are equivalent, then there is a sequence of top-down 
replacements, bottom-up replacements, and left-right reflections that produces a bi- 
jection from binary trees avoiding s to binary trees avoiding t. 

In this section we discuss qualitative results regarding this conjecture. 

Given two TO-leaf tree patterns s and t, one can ask which permutations of 
the leaves give rise to a top-down replacement that induces a bijection from trees 
avoiding s to trees avoiding t. Most permutations arc not viable. Candidate permu- 
tations can be found experimentally by simply testing all m! permutations of leaves 
on a set of randomly chosen binary trees avoiding s; one checks that the image 
avoids t and that composing the top-down replacement with the inverse bottom-up 
replacement produces the original tree. This approach is feasible for small m, but 
it is slow and does not provide any insight into why certain trees are equivalent. A 
question unresolved at present is to efhciently find all such bijections. 

We return briefly to the replacement rule of Section [5. 1| to mention that a minor 
modification produces a bijection on the full set of binary trees. Namely, take the 
two replacement rules 



abed \ / ^ bead 

a b be 

Again we perform a top-down replacement, now using both rules together. That is, 
if a subtree matches the left side of either rule, we restructure it according to that 
rule. Of course, it can happen that a particular subtree matches both replacement 
rules, resulting in potential ambiguity; in this case which do we apply? Well, if 
both rules result in the same transformation, then it does not matter, and indeed 
with our present example this is the case. To show this, it suffices to take the 
intersection t2 n ^3 of the two left sides and label the leaves to represent additional 
branches that may be present: 




Pi Pi /\ Pi 
Pi Pi 



Now we check that applying each of the two replacement rules to this tree produces 
the same labeled tree, namely 




Pi Pi/ \ Ps 

Pi Pi 
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Therefore we need not concern ourselves with which rule is applied to a given 
subtree that matches both. Since the replacement rules agree on their intersection, 
the top-down replacement is again invertible and is therefore a bijection from the 
set of binary trees to itself. By the examination of cases in Section [5.1[ this bijection 
is an extension of the bijection between binary trees avoiding t2 and binary trees 
avoiding t^. 

Thus we may choose from two types of bijection when searching for top-down 
replacement bijections that prove avoiding-equivalence. The first type is from bi- 
nary trees avoiding s to binary trees avoiding t, using one rule for the top-down 
direction and the inverse for the bottom-up direction; these bijections in general 
do not extend to bijections on the full set of binary trees. The second type is a 
bijection on the full set of binary trees, using both rules in each direction, that 
induces a bijection from binary trees avoiding s to binary trees avoiding t. 

We conclude with a curious example in which two tree patterns can only be 
proven equivalent by a two-rule bijection that does not involve them directly. The 
trees 

*7 = ^"^^ ^ 

in class 6.5 are avoiding-equivalent by the permutation 126345, but neither 

nor its left -right reflection has an equivalence-proving permutation to tr, tu, or 
their left-right reflections. Thus, this equivalence cannot be established by a bijec- 
tion that swaps 6-leaf tree patterns. However, it can be established by a bijection 
that swaps 4-leaf tree patterns: The previously mentioned bijection consisting of 
the two replacement rules 

abed bead 
a b be 

induces a top-down replacement bijection from trees avoiding t-^ to trees avoiding 
tn- The reason is that <7 and tn are formed by two overlapping copies of the 
class 4.2 trees 

^ and 

respectively, and that and tu are mapped to each other under this bijection. 
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Appendix. Table of equivalence classes 



This appendix lists equivalence classes of binary trees with at most 6 leaves. 
Left-right reflections are omitted for compactness. For each class we provide a 
polynomial equation satisfied by / = EnL_t(a;,y); an equation satisfied by Avt(a;) 
is obtained in each case by letting y — 0- 

The data was computed by the Mathematica package TreePatterns 8J using 
Singular via the interface package by Manuel Kauers and Viktor Levandovskyy 
[B]. Pre-computed data extended to 8-leaf binary trees is now also available in 
TreePatterns. The number of equivalence classes of m-leaf binary trees for 
m = 1, 2, 3, . . . is 1, 1, 1, 2, 3, 7, 15, 44, ... . 

Class 1.1 (1 tree). 



^yf^ - f + xy = 



Class 2.1 {I tree). 



A 



xyf -f + x = 



Class 3.1 (2 trees). 



xyf + {-x\y-l)-l)f + x = 



Class 4-1 (2 trees). 




{xy - - 1)) /2 + - 1) - 1) / + x = 



Class 4-2 (3 trees). 




xyf + {-2x\y - 1) - l) / + {x^{y - 1) + x) = 



Class 5.1 (2 trees). 




=^Hy - 1)/^ + i^y - ^^{y - i)) /' + {-^\y - i) - i) / + ^ = o 



Class 5.2 (10 trees). 




{xy - x^{y - 1)) f + {x^ {x^ - 2) - 1) - l) / + {x-\y - 1) + x) = 
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Class 5.3 (2 trees). 




^yf + {-3x^(y - 1) - 1) + {sx^iy -i) + x)f- x\y - i) = o 
Class 6.1 (2 trees). 




-x^(y - l)f - x\y - l)f + {xy - x^'iy - 1)) f + {~x\y -l)-l)/ + x = 

Class 6.2 (8 trees). 




- x^{y - l)f + X (x2 (x2 - 1) (y - 1) + y) f + 

{x^ - 2) {y - 1) - 1) / + (x'3(y - 1) + x) 

Class 6.3 (14 trees). 

%^ <^ ^ ^ ^ A 

X {x^ (x2 - 2) - 1) + y) f + {2x^ {x^ - l) (j/ - 1) - l) / + (x^d/ - 1) + x) = 

CZass f).^ (8 trees). 




{xy - x'^iy - 1)) f + (x2 (2^2 _ 3) (j, - 1) - l) f + 

{-x\y - 1) + 3x3(2; -l) + x)f- x\y - 1) 

Class 6.5 (6 trees). 

(xy - 2x3(y - 1)) f + (x^ (Sx^ - 2) (y - 1) - l) / + {-x'>(y - 1) + x3(y - 1) + x) = 

Class 6.6 (2 trees). 




-xy/4 + (4x2{y - 1) + l) + (-6x^(2/ - 1) - x) + 4x*(j/ - 1)/ - x\y - I) = 



Pattern avoidance in binary trees 



19 



Class 6.7 (2 trees). 




{x^{y ^l)-y){y- l)f + {--2x'{y - l)^ + x^{y - l){3j/ - 2) - x^{y - 1) + xy) f + 
(x2 - 1) - -ix^y -l) + x^-2) (y - 1) - l) / + (x'' {y - l)^ + - I) + x) = 
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